Materialized View Selection for Multidimensional Datasets
نویسندگان
چکیده
To fulfill the requirement of fast interactive multidimensional data analysis, database systems precompute aggregate views on some subsets of dimensions and their corresponding hierarchies. However, the problem of what to precompute is difficult and intriguing. The leading existing algorithm, BPUS, has a running time that is polynomial in the number of views and is guaranteed to be within (0.63 f) of optimal, where f is the fraction of available space consumed by the largest aggregate. Unfortunately, BPUS can be impractically slow, and in some instances may miss good solutions due to the coarse granularity at which it makes its decisions of what to precompute. In view of this, we study the structure of the precomputation problem and show that under certain broad conditions on the multidimensional data, an even simpler and faster algorithm, PBS, achieves the same (0.63 f) bound. Our empirical study of the behavior of PBS shows that even when this condition does not hold, PBS picks a surprisingly good set of aggregates for precomputation. Furthermore, BPUS and other previous work has assumed that all aggregates are either precomputed in their entirety or not at all. We show that if one relaxes this and allows aggregates to be partially precomputed, not only is it possible to find solutions that are better than those found by previous algorithms, in some cases it is even possible to find solutions that are better than the solution that is ‘optimal’ by the previous definition. This research is supported by a gift from NCR Corp., and by ARPA through Rome Air Force Laboratory contract F30602-972-0247 Permission to copy without fee all or part of this material is gmnted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment. Proceedings of the 24th VLDB Conference New York, USA, 1998 Jeffrey F. Naughton [email protected]
منابع مشابه
Materialized View Selection in a Multidimensional Database
A multidimensional database is a data repository that supports the efficient execution of complex business decision queries. Query response can be significantly improved by storing an appropriate set of materialized views. These views are selected from the multidimensional lattice whose elements represent the solution space of the problem. Several techniques have been proposed in the past to pe...
متن کاملMaterialized Views Selection in a Multidimensional Database
A multidimensional database is a data repository that supports the eecient execution of complex business decision queries. Query response can be signiicantly improved by storing an appropriate set of materialized views. These views are selected from the multidimen-sional lattice whose elements represent the solution space of the problem. Several techniques have been proposed in the past to perf...
متن کاملOptimizing Aggregate SPARQL Queries Using Materialized RDF Views
During recent years, more and more data has been published as native RDF datasets. In this setup, both the size of the datasets and the need to process aggregate queries represent challenges for standard SPARQL query processing techniques. To overcome these limitations, materialized views can be created and used as a source of precomputed partial results during query processing. However, materi...
متن کاملA Solution to View Management to Build a Data Warehouse
Several techniques exist to select and materialize a proper set of data in a suitable structure that manage the queries submitted to the online analytical processing systems. These techniques are called view management techniques, which consist of three research areas: 1) view selection to materialize, 2) query processing and rewriting using the materialized views, and 3) maintaining materializ...
متن کاملAutomated Selection of Materialized Views and Indexes in SQL Databases
Automatically selecting an appropriate set of materialized views and indexes for SQL databases is a non-trivial task. A judicious choice must be cost-driven and influenced by the workload experienced by the system. Although there has been work in materialized view selection in the context of multidimensional (OLAP) databases, no past work has looked at the problem of building an industry-streng...
متن کاملMaterialized Viewsin Multidimensional Databases
This chapter presents materialized views in the context of multidimensional databases (MDDBs). A materialized view is a view whose content is explicitly stored in the database. The advantage of materializing views is that it is not necessary to recompute the query every time the view is accessed. The shortcoming is that it has to be kept consistent with the updates on the base tables. However, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998